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Abstract 

Span programs are a model of computation that have been used to design quantum algo¬ 
rithms, mainly in the query model. It is known that for any decision problem, there exists 
a span program that leads to an algorithm with optimal quantum query complexity, however 
finding such an algorithm is generally challenging. 

In this work, we consider new ways of designing quantum algorithms using span programs. 

We show how any span program that decides a problem / can also be used to decide “property 
testing” versions of the function /, or more generally, approximate a quantity called the span 
program witness size , which is some property of the input related to /. For example, using our 
techniques, the span program for OR, which can be used to design an optimal algorithm for the 
OR function, can also be used to design optimal algorithms for: threshold functions, in which 
we want to decide if the Hamming weight of a string is above a threshold, or far below, given 
the promise that one of these is true; and approximate counting, in which we want to estimate 
the Hamming weight of the input up to some desired accuracy. We achieve these results by 
relaxing the requirement that 1-inputs hit some target exactly in the span program, which could 
potentially make design of span programs significantly easier. 

In addition, we give an exposition of span program structure, which increases the general 
understanding of this important model. One implication of this is alternative algorithms for 
estimating the witness size when the phase gap of a certain unitary can be lower bounded. We 
show how to lower bound this phase gap in certain cases. 

As an application, we give the first upper bounds in the adjacency query model on the 
quantum time complexity of estimating the effective resistance between s and t, R st {G). For 
this problem we obtain R s ,t(G)), using O(logn) space. In addition, when p is a 

lower bound on A 2 (G), by our phase gap lower bound, we can obtain an upper bound of 
O | \n^/R Sy t(G)/fij for estimating effective resistance, also using O(logn) space. 

1 Introduction 

Span programs are a model of computation first used to study logspace complexity [KW93], and 
more recently, introduced to the study of quantum algorithms in [RS12], They are of immense the¬ 
oretical importance, having been used to show that the general adversary bound gives a tight lower 
bound on the quantum query complexity of any decision problem [Rei09, Reill]. As a means of de¬ 
signing quantum algorithms, it is known that for any decision problem, there exists a span-program- 
based algorithm with asymptotically optimal quantum query complexity, but this fact alone gives 
no indication of how to find such an algorithm. Despite the relative difficulty in designing quantum 
algorithms this way, there are many applications, including formula evaluation [RS12, Reill], a 
number of algorithms based on the learning graph framework [Bell2b], sf-connectivity [BR12] and 
^-distinctness [Bell2a]. Although generally quantum algorithms designed via span programs can 
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only be analyzed in terms of their query complexity, in some cases their time complexity can also be 
analyzed, as is the case with the quantum algorithm for st-connectivity. In the case of the quantum 
algorithm for ^-distinctness, the ideas used in designing the span program could be turned into a 
quantum algorithm for 3-distinctness with time complexity matching its query complexity up to 
logarithmic factors [BCJ + 13]. 

In this work, we consider new ways of designing quantum algorithms via span programs. Con¬ 
sider Grover’s quantum search algorithm, which, on input x € {0, l} n , decides if there is some 
i € [n] such that Xi = 1 using only 0{y/n) quantum operations [Gro96]. The ideas behind this 
algorithm have been used in innumerable contexts, but in particular, a careful analysis of the ideas 
behind Grover’s algorithm led to algorithms for similar problems, including a class of threshold 
functions: given x € {0, l} n , decide if |cc| > t or |x| < et , where |x| denotes the Hamming weight; 
and approximate counting: given x € {0, l} n , output an estimate of |a;| to some desired accuracy. 
The results in this paper offer the possibility of obtaining analogous results for any span program. 
That is, given a span program for some problem /, our results show that one can obtain, not only 
an algorithm for /, but algorithms for a related class of threshold functions, as well as an algorithm 
for estimating a quantity called the span program witness size, which is analogous to \x\ in the above 
example (and is in fact exactly l/|x| in the span program for the OR function — see Section 2.3). 

New Algorithms from Span Programs We give several new means of constructing quantum 
algorithms from span programs. Roughly speaking, a span program can be turned into a quantum 
algorithm that decides between two types of inputs: those that “hit” a certain “target vector”, and 
those that don’t. We show how to turn a span program into an algorithm that decides between 
inputs that get “close to” the target vector, and those that don’t. Whereas as traditionally a span 
program has been associated with some decision problem, this allows us to now associate, with one 
span program, a whole class of threshold problems. 

In addition, for any span program P, we can construct a quantum algorithm that estimates the 

positive witness size, w + (x), to accuracy e in w + (x)W- queries, where W- is the approximate 

negative witness complexity of P. This construction is useful whenever we can construct a span 
program for which w+(x) corresponds to some function we care to estimate, as is the case with 
the span program for OR, in which w+(x) = |^y, or the span from for st-connectivity, in which 

w+(G) = ^R s j(G), where G is a graph, and R s ,t(G ) is the effective resistance between s and t in 
G. We show similar results for estimating the negative witness size as well. 

Structural Results Our analysis of the structure of span programs increases the theoretical 
understanding of this important model. One implication of this is alternative algorithms for esti¬ 
mating the witness size when the phase gap (or spectral gap) of a certain unitary associated with 
the span program can be lower bounded. This is in contrast to previous span program algorithms, 
including those mentioned in the previous paragraph, which have all relied on effective spectral gap 
analysis. We show how the phase gap can be lower bounded by , where A and A{x ) are 

linear operators associated with the span program and some input x, and u m and a max are the 
smallest and largest nonzero singular values. 

In addition, our exposition highlights the relationship between span programs and estimating 
the size of the smallest solution to a linear system, which is a problem solved by [HHL09]. It is not 
yet clear if this relationship can lead to new algorithms, but it is an interesting direction for future 
work, which we discuss in Section 5. 


2 




Application to Effective Resistance An immediate application of our results is a quantum 
algorithm for estimating the effective resistance between two vertices in a graph, R s j(G). This ex¬ 
ample is immediate, because in [BR12], a span program for st-connectivity was presented, in which 
the positive witness size corresponds to R s j(G). The results of [BR12], combined with our new 
span program algorithms, immediately yield an upper bound of 0(^2 n \JR s< t(G)) for estimating 
the effective resistance to relative accuracy e. This upper bound also holds for time complexity, 
due to the time complexity analysis of [BR12]. Using our new spectral analysis techniques, we 
are also able to get an often better upper bound of O (fn^jR S)t (G)/p), on the time complexity of 
estimating effective resistance, where p is a lower bound on A 2 (G), the second smallest eigenvalue 
of the Laplacian. Both algorithms use O(logn) space. We also show that a linear dependence on 
n is necessary, so our results cannot be significantly improved. 

These are the first quantum algorithms for this problem in the adjacency query model. Previous 
results have studied the problem in the edge-list model [Wanl3]. At the end of Section 4, we 
compare the techniques used in [Want3] to those of our algorithms. Classically, this quantity can 
be computed exactly by inverting the Laplacian, which costs 0(m) = 0(n 2 ), where m is the number 
of edges in the input graph. 

Outline In Section 1.1, we describe the algorithmic subroutines and standard linear algebra that 
will form the basis of our algorithms. In Section 2.1, we review the use of span programs in the 
context of quantum query algorithms, followed in Section 2.2 by our new paradigm of approximate 
span programs. At this point we will be able to formally state our results about how to use 
span programs to construct quantum algorithms. In Section 2.4, we describe the structure of span 
programs, giving several results that will help us develop algorithms. The new algorithms from 
span programs are developed in Section 3, and finally, in Section 4, we present our applications to 
estimating effective resistance. In Section 5, we discuss open problems. 

1.1 Preliminaries 

To begin, we fix notation and review some concepts from linear algebra. By C(V, W) we denote 
the set of linear operators from V to W. For any operator A € C(V,W), we denote by colA the 
columnspace, rowA the rowspace, and ker A the kernel of A. 

Definition 1.1 (Singular value decomposition). Any linear operator A € C{V, W) can be written as 
A = ^T =1 (Ti\ipi)((j>i\ for positive real numbers Oi, called the singular values, an orthonormal basis 
for rowA, {\(fi)}i, called the right singular vectors, and an orthonormal basis for colA, 
called the left singular vectors. We define er m j n (A) := min, a t and cr max (A) := maxjCTj. 

Definition 1.2 (Pseudo-inverse). For any linear operator A with singular value decomposition 
A = Y, r i=i°iWi)(<t>i\, we define the pseudo-inverse of A as A + := Y^i=i — We note that 

A + A is the orthogonal projector onto rowA, and AA + is the orthogonal projector onto colA. For 
any |u) € rowA, the unique smallest vector |rc) satisfying A\w) = |u) is A + \v). 

The algorithms in this paper solve either decision problems, or estimation problems. 

Definition 1.3. Let f : X C [q] n —>• {0,1}. We say that an algorithm decides / with bounded 
error if for any x E X, with probability at least 2/3, the algorithm outputs f(x) on input x. 

Definition 1.4. Let f : X C [q] n —>• M>q. We say that an algorithm estimates / to relative 
accuracy e with bounded error if for any x € X, with probability at least 2/3, on input x the 
algorithm outputs f such that |/(x) — f\ < ef(x). 
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We will generally omit the description “with bounded error”, since all of our algorithms will 
have bounded error. 

All algorithms presented in this paper are based on the following structure. We have some 
initial state |</>o), and some unitary operator U, and we want to estimate ||IIo|<fo)||j where IIo is 
the orthogonal projector onto the 1-eigenspace of U. The first step in this process is a quantum 
algorithm that estimates, in a new register, the phase of U applied to the input state. 

Theorem 1.5 (Phase Estimation [Kit95, CEMM98]). Let U = YljLi e%8j \' l Pj}(' , Pj\ be a unitary, 
with € (—7r,vr]. For any 0 € (0, 7r) and e € (0,1), there exists a quantum algorithm 

that makes O (^log^) controlled calls to U and, on input \ipj), outputs a state \ifj)\w) such that 
if dj = 0, then |w) = |0), and if \6j\ > ©, |(0|cu)| 2 < e. If U acts on s qubits, the algorithm uses 
0(s + log q) space. 

The precision needed to isolate IIo|</>o) depends on the smallest nonzero phase of U, the phase gap. 

Definition 1.6 (Phase Gap). Let {e l<1 be the eigenvalues of a unitary operator U, with 
{0j\jaS C (—7T, 7r]. Then the phase gap of U is A (U) := min{|#jj : 0j ^ 0}. 

In order to estimate ||IIo|</>o)|| 2 , given a state |0)IIo|</>o) + |1)(/ — IIo)|</>o), we use the following. 


Theorem 1.7 (Amplitude Estimation [BHMT02]). Let A be a quantum algorithm that outputs 
Y / p(x)|0)| v I , x (0)) + \J\ — p(x)|l)|d' x (l)) on input x. Then there exists a quantum algorithm that 


estimates p(x ) to precision e using O 


l i 




calls to A. 


If we know that the amplitude is either < po or > p\ for some po < pi, then we can use 
amplitude estimation to distinguish between these two cases. 


Corollary 1.8 (Amplitude Gap). Let A be a quantum algorithm that outputs yp(x)|0)|\I' x (0)} + 
y/l — p(x)|l)|'I , x (l)) on input x. For any 0 < p\ < po < 1, we can distinguish between the cases 

p(x) > po and p(x) < p\ with bounded error using O 1 calls to A. 

Proof. By [BHMT02, Thm. 12], using M calls to A, we can obtain an estimate p of p(x) such that 


\p-p{x)\ < 


2 tt v / p(x )(1 -p{x)) 

M + M 2 


with probability 3/4. Let M = 47 r. Then note that for any x\ and xo such that p(x i) < p\ 
and p(x 0 ) > p 0 , we have, using y/p 0 + Pi > {y/Po + y/Pi)/V 2, 


m >2V2 = 2 ^ 2 »— 1 — > 2V2* , 1 = 2 

P0 ~ Pi y/PO - y/Pl yjpfx 0 ) - y/p(x i) P{X 0 ) ~ P{X l) 


If p\ is the estimate obtained on input x\, then we have, with probability 3/4: 


- . 2tv/p(^i)(1-p(^i)) , J 2 , , \FF)(PM - PM) 

Pl - P(l) m + M 2 - p(rtl,+ ^(yFM + v ^0) 


(Po - Pi) 2 

16(po +Pi\ 


On the other hand, if po is an estimate of p{x 0 ), then with probability 3/4: 


~ ^ / X 2vr v / p(x 0 )(l -p(x 0 )) 
Po > P{X 0 )- 77 - 


_nf_ > , Vp(xq)(p{xo) ~p(x 1 )) 
M 2 ~ P 0 y/2{y/p{xo) + y/p{x 1 )) 


{po-Pi) 2 

16(po +PiY 
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We complete the proof by showing that p\ < po, so we can distinguish these two events. We have: 


Po-Pi > p( x o) ~ p( x i) — 


(p( x o) -P( x i)) 

V2(\/p{ x o) + \/p{ x 1 )) 


(Vp( x o) + Vp( x i)) 


> ( 1 _ ^f) (Po-Pi)-^(Po-Pi) ^ \{PQ-Pi) 


> 0 . 


{Po ~ Pi) 2 

8(po±Pi) 


Thus, using 4 - 7 T = O (pWpf) ca ^ s to ^ we can distinguish between p(x) < pi and p(x) > po 

with success probability 3/4. □ 


In order to make use of phase estimation, we will need to analyze the spectrum of a particular 
unitary, which, in our case, consists of a pair of reflections. The following lemma first appeared in 
this form in [LMR+11]: 

Lemma 1.9 (Effective Spectral Gap Lemma). Let U = ( 211^4 — I)(2T1b — I ) be the product of two 
reflections, and let lie be the orthogonal projector onto span{ |u) : U\u) = e l9 \u), \6\ < 0}. Then if 

u A \u) = o, ||n e n B |«)|| <f |H||. 

The following theorem was first used in the context of quantum algorithms by Szegedy [Sze04]: 

Theorem 1.10 ([Sze04]). Let U = ( 2 IT 4 — I)(2TIb — I) be a unitary on a finite inner product space 
H containing A = span{|^i),..., |V^a.)} and B = span{|</>i),..., |</>{,)}. Let = Ya =1 an d 

U B = Yl\=i |^j) (</>i | • Let D = U A U B be the discriminant of U, and suppose it has singular value 
decomposition Y^j=\ cos dj\ a j)(fij\’ with 0j E [0, ^]. Then the spectrum of U is {e ±2iej }j. The 
1-eigenspace of U is (An B) 0 {A 1 - n B L ) and the —1-eigenspace is (A n B 1 -) 0 ( A - 1 n B). 

Let A a = E“=i l''/ ; i)(/l and ^-b = \ < j > j)(j\- We no ^ e that i n the original statement of 

Theorem 1.10, the discriminant is defined D' = A^A#. However it is easy to see that D' and D 
have the same singular values: if D' = JT a/v/) (iq| is a singular value decomposition of D', then 
D = a i^-A\vi)(ui\A^ B is a singular value decomposition of D, since A a acts as an isometry on 
the columns of D', and A b acts as an isometry on the rows of D'. 

The following corollary to Theorem 1.10 will be useful in the analysis of several algorithms. 


Corollary 1.11 (Phase gap and discriminant). Let D be the discriminant of a unitary U = (2n^ — 
L)(2TIb — I). Then A(—17) > 2a m \ n (D). 

Proof. By Theorem 1.10, if {do = cos#o < 01 = cos$i < ...a m = cos 9 m } are the singular 
values of D , for 9j E [0, ^], then U has phases {±2 9j} 7 JL 0 C [—vr, 7 r], and so —U has phases 
{±29, 0 tt}™ 0 = {±(20,- - vr)}™ 0 C [-tt, tt] . Thus 

A(—17) = min{| 7 r — 29j\ : 9j 7 ^ 7 r/ 2 } = \tt — 2cos -1 minjcrj : a, 7 ^ 0}| = \tt — 2 cos -1 <r m i n (.D)|. 


We have 9 > sin$ = cos( 7 r /2 — 9), so <r m i n (D) > cos(-7t/2 — < 7 m i n (-D)). Then since cos is decreasing 
on the interval [0, 7 t/2 ], we have cos _ 1 (< 7 m i n (Zl)) < tt/2 — cr m i n (D), and thus 


A (-U) > \tt - 2 (tt/2 - <r min (D))| = 2a min (D). 


□ 


5 











2 Approximate Span Programs 

2.1 Span Programs and Decision Problems 

In this section, we review the concept of span programs, and their use in quantum algorithms. 
Definition 2.1 (Span Program). A span program P = A) on [q] n consists of 

1. finite-dimensional inner product spaces H = H\ © • • • 0 H n © i/true © -fffalse; and C 

l ij }//] ji(_ [f/j such that © * * * © ^ , 

2. a vector space V, 

3. a target vector r£h, and 

4■ a linear operator A € C(H, V). 

To each string x € [q] n , we associate a subspace H(x ) := Hi >Xl © ■ ■ ■ © H njXn © H trU e- 

Although our notation in Definition 2.1 deviates from previous span program definitions, the 
only difference in the substance of the definition is that the spaces Hj, a and Hjj, for a b need 
not be orthogonal in our definition. This has the effect of removing log q factors in the equivalence 
between span programs and the dual adversary bound (for details see [Jefl4, Sec. 7.1]). The spaces 
H tine and Pf a i se can be useful for designing a span program, but are never required, since we can 
always add an (n + l) th variable, set x n+ \ = 1, and let H n+ i : o = Pf a i se and H n+ yi = H tIue . 

A span program on [q] n partitions [q] n into two sets: positive inputs, which we call Pi, and 
negative inputs, which we call Pq. The importance of this partition stems from the fact that a span 
program may be converted into a quantum algorithm for deciding this partition in the quantum 
query model [Rei09, Reill]. Thus, if one can construct a span program whose partition of [q] n 
corresponds to a problem one wants to solve, an algorithm follows. In order to describe how a span 
program partitions [q\ n and the query complexity of the resulting algorithm, we need the concept 
of positive and negative witnesses and witness size. 

Definition 2.2 (Positive and Negative Witness). Fix a span program P on [q] n , and a string 
x E [q] n . We say that |rc) is a positive witness for x in P if |w) E H(x), and A\w) = r. We define 
the positive witness size of x as: 

w .|_(®, P) = w + {x) = min{|||u ;)|| 2 : |w) E H{x) : A\w) = r}, 

if there exists a positive witness for x, and w + (x) = oo else. We say that to € £(V, M) is a negative 
witness for x in P if to AH = 0 and tor = 1. We define the negative witness size of x as: 

W-{x, P ) = W-(x) = min{||u;A|| 2 : to E £(V,R) : toAYl H ^ = 0 ,tor = 1}, 

if there exists a negative witness, and W-(x) = oo otherwise. If w + (x) is finite, we say that x is 
positive (wrt. P), and if w-(x) is finite, we say that x is negative. We let P\ denote the set of 
positive inputs, and Pq the set of negative inputs for P. Note that for every x E [q] n , exactly one 
of W-(x) and w + {x) is finite; that is, (Po,Pi) partitions [q] n . 

For a decision problem / : X C [q] n —>• {0,1}, we say that P decides f if / -1 (0) C P 0 and 
f~ l { 1) C Pi. In that case, we can use P to construct a quantum algorithm that decides /. 
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Theorem 2.3 ( [Rei09] ). Fix f : X C [q] n —>• {0,1}, and let P be a span program on [q] n that 
decides f. Let W+(f,P) = max^^-i ^w + (x,P) and W-(f,P) = max ie j^i( 0 ) w-(x, P). Then 
there exists a quantum algorithm that decides f using 0(y/W+(f, P)W-(f, P)) queries. 

We call y/W+(f, P)W-(f, P ) the complexity of P. It is known that for any decision problem, 
there exists a span program whose complexity is equal, up to constants, to its query complexity 
[Rei09, Reill] ([Jefl4, Sec. 7.1] removes log factors in this statement), however, it is generally a 
difficult task to find such an optimal span program. 


2.2 Span Programs and Approximate Decision Problems 

Consider a span program P and x G Pq. Suppose there is some M G H(x) such that A\w) comes 
extremely close to r. We might say that x is very close to being in Pi. If all vectors in H(y) 
for y € Po \ {x} are very far from r, it might be slightly more natural to consider the partition 
(Po \ {x},P\ U {x}) rather than (Po, Pi). 

As further motivation, we mention a construction of Reichardt [Rei09, Sec. 3 of full version] 
that takes any quantum query algorithm with one-sided error, and converts it into a span program 
whose complexity matches the query complexity of the algorithm. The target of the span program 
is the vector 11,0), which corresponds to a quantum state with a 1 in the answer register and Os 
elsewhere. If an algorithm has no error on 1-inputs, it can be modified so that it always ends in 
exactly this state, by uncomputing all but the answer register. An algorithm with two-sided error 
cannot be turned into a span program using this construction, because there is error in the final 
state. This is intuitively in opposition to the evidence that span programs characterize bounded 
(two-sided) error quantum query complexity. The exactness required by span programs seems to 
contrast the spirit of non-exact quantum algorithms. 

This motivates us to consider the positive error of an input, or how close it comes to being 
positive. Since there is no meaningful notion of distance in V, we consider closeness in H. 


Definition 2.4 (Positive Error). For any span program P on [q] n , and x G [ q] n , we define the 
positive error of x in P as: 


e+(x) = e+(x, P) := min 


n 


H(xY 


r | w) : A\w) = r 


Note that e + {x. P) = 0 if and only if x G Pi. Any M such that 
min-error positive witness for x in P. We define 


^H(x) ± \ w ) 


2 


e_|_(x) is called a 


w+(x) = w + (x,P) := min 


Mil : A \ w ) = P 


n 


H(x)- 


\W) 



A min-error positive witness that also minimizes ||M || 2 called an optimal min-error positive 
witness for x. 


Note that if x G Pi, then e+(x) = 0. In that case, a min-error positive witness for x is just a 
positive witness, and w+(x) = w+(x). 

We can define a similar notion for positive inputs, to measure their closeness to being negative. 

Definition 2.5 (Negative Error). For any span program P on [q] n and x G [ q] n , we define the 
negative error of x in P as: 

e_(x) = e_(x, P) := min j UwAII^^) || 2 : uj(t) = 1 j . 
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Again, e_(x, P) = 0 if and only if x £ Pq. Any oj such that cj.AIIhy x ) = e_(x, P) is called a 
min-error negative witness for x in P. We define 

W-(x) = ui-(x, P) := min j ||c<jv4|| 2 : uj(t) = 1, llw-AII#^) || 2 = e_(x, P) j . 

A min-error negative witness that also minimizes ||cvA|| 2 is called an optimal min-error negative 
witness for x. 


It turns out that the notion of span program error has a very nice characterization as exactly 
the reciprocal of the witness size: 


Vx G Po> W-(x) 


1 

e+(x)’ 


and 


Vx £ Pi, w + (x) 


1 

e-(x ) ’ 


which we prove shortly in Theorem 2.10 and Theorem 2.11. This is a very nice state of affairs, for a 
number of reasons. It allows us two ways of thinking about approximate span programs: in terms of 
how small the error is, or how large the witness size is. That is, we can say that an input x £ Po is 
almost positive either because its positive error is small, or equivalently, because its negative witness 
size is large. In general, we can think of P as not only partitioning P into (Pq, P\), but inducing an 
ordering on [q] n from most negative — smallest negative witness, or equivalently, largest positive 
error — to most positive — smallest positive witness, or equivalently, largest negative error. For 
example, on the domain {x^,... , x^ 6 )} C [ q] n , P might induce the following ordering: 

x (l) s (2) ! x (3) I x (4) x (5) I x (6) 

i —;-;— - ;—~— 1 -- I -;-;--- 7- 1 - 

increasing positive error/ ' increasing negative error/ 
decreasing negative witness size decreasing positive witness size 


The inputs {x^, x^, x^ 3 )} are in Pq, and W-(x^) < W-(x^) < W-(x^) (although it is gen¬ 
erally possible for two inputs to have the same witness size). The inputs {x^\ x®, x®} are 
in Pi, and w + (x ( -' i ' ) ) > w + (x^) > w + (x^). The span program exactly decides the partition 
({x^, x( 2 \ x® }, {x (4) , x, x^}), but we say it approximates any partition that respects the 
ordering. If we obtain a partition by drawing a line somewhere on the left side, for example 
({x^, x^ 2 )}, (x^ 3 ), x( 4 \x( 5 ), x^}), we say P negatively approximates the function corresponding to 
that partition, whereas if we obtain a partition by drawing a line on the right side, for example 
({x^ 1 ), x^ 2 ), x^ 3 \ x^ 4 ), x^ 5 )}, (x^ 6 )}), we say P positively approximates the function. 

Definition 2.6 (Functions Approximately Associated with P). Let P he a span program on [q] n , 
and f : X C [q] n —>• {0,1} a decision problem. For any A £ (0,1), we say that P positively A- 
approximates / i//" 1 (l) C P 1; and for all x £ f~ 1 ( 0), either x £ Pq, or w+(x,P) > jW+(f,P). 
We say that P negatively A-approximates / i// _1 (0) C P 0; and for all x £ / -1 ( 1), either x £ Pi, 
or w-{x,P) > jW-(f,P). If P decides f exactly, then both conditions hold for any value of X, 
and so we can say that P 0- approximates f. 

This allows us to consider a much broader class of functions associated with a particular span 
program. This association is useful, because as with the standard notion of association between a 
function / and a span program, if a function is approximated by a span program, we can convert 
the span program into a quantum algorithm that decides / using a number of queries related to 
the witness sizes. Specifically, we get the following theorem, proven in Section 3. 







Theorem 2.7 (Approximate Span Program Decision Algorithms). Fix / :IC [ q] n —>• {0,1}, and 
let P be a span program that positively X-approximates f. Define 

W+= W+(f,P) := max w+(x,P) and W- = W-(f,P):= max w-(x,P). 

xef-H i) xef-H o) 

There is a quantum algorithm that decides f with bounded error in O ^ log l^A^) 9 uer * es - 

Similarly, let P be a span program that negatively X-approximates f. Define 

W- = W-(f, P) := max W-(x,P ) and W + = W+(f, P) := max w + (x,P). 

xef - 1 (o) aie/- 1 ! l) 


There is a quantum algorithm that decides f with bounded error in O 


(l_ A )3/2 iog — 


queries. 


With the ability to distinguish between different witness sizes, we can obtain algorithms for 
estimating the witness size. 


Theorem 2.8 (Witness Size Estimation Algorithm). Fix f : X C [q] n —>• M> 0 . Let P be a span 
program such that for all x € X, f(x) = w + (x, P ) and define W- = W-(f, P ) = max xe x ui-(x, P). 

There exists a quantum algorithm that estimates f to accuracy e in O ^7372 \Jw + (x)W-^j queries. 

Similarly, let P be a span program such that for all x € X, f(x) = w-(x,P) and define W + = 
W+(f, P ) = maxj-gx w+{x, P ). Then there exists a quantum algorithm that estimates f to accuracy 


e in O 


^Jw-(x)W + 


queries. 


The algorithms of Theorem 2.7 and 2.8 involve phase estimation of a particular unitary U, as 
with previous span program algorithms, in order to distinguish the 1-eigenspace of U from its other 
eigenspaces. In general, it may not be feasible to calculate the phase gap of U, so for the algorithms 
of Theorem 2.7 and 2.8, as with previous algorithms, we use the effective spectral gap lemma to 
bound the overlap of a particular initial state with eigenspaces of U corresponding to small phases. 
However, by relating the phase gap of U to the spectrum of A and A(x) := ATl H t x \, we show how 
to lower bound the phase gap in some cases, which may give better results. In particular, in our 
application to effective resistance, it is not difficult to bound the phase gap in this way, which leads 
to an improved upper bound. In general we have the following theorem. 


Theorem 2.9 (Witness Size Estimation Algorithm Using Real Phase Gap). Fix f : X C [q] n — >• 
M>o and let P = ( H,V,t,A) be a normalized span program (see Definition 2.12) on [q] n such that 
for all x £ X, f(x) = w+(x,P ) (resp. f(x) = W-{x)). If n > a ^° r a ^ X ^ ^ en 

quantum query complexity of estimating f(x) to relative accuracy e is at most O 

Theorem 2.7 is proven in Section 3.2, and Theorem 2.8 is proven in Section 3.3, and Theorem 2.9 
is proven in Section 3.4. 



2.3 Example 

To illustrate how these ideas might be useful, we will give a brief example of how a span program 
that leads to an algorithm for the OR function can be combined with our results to additionally 


9 










give algorithms for threshold functions and approximate counting. We define a span program P on 
{0, l} n as follows: 


V = M, t = 1, Hi = Hi t i = span{|i)}, H it0 = {0}, A = ^ (z|. 

Z=1 

So we have H = span{|i) : i G [n]} and H(x) = span{|i) : Xi = 1}. It’s not difficult to see that P 
decides OR. In particular, we can see that the optimal positive witness for any x such that |a;| > 0 
is | w x ) = Yli - Xi =l ]^f|*)- The only linear function oj : M —>• R that maps r to 1 is the identity, and 
indeed, this is a negative witness for the string 0 = 0... 0, since H( 0) = {0}, and so ujAH h ^ = 0. 

Let A G (0,1), t G [n], and let / be a threshold function defined by f(x) = 1 if \x\ > t and 
f(x ) = 0 if |x| < At, with the promise that one of these conditions holds. Note that if f(x) = 1, 
then w+(x) = |||^x}|| 2 = A < so W+(f,P) = ~. On the other hand, if f(x) = 0, then 
w + (x) = |^i > ^ = jW+(f, P), so P positively A-approximates /. The only approximate negative 
witness is ui the identity, so we have W- = ||cu^4|| 2 = ||7l|| 2 = n. By Theorem 2.7, there is a 
quantum algorithm for / with query complexity ^_^ 3 /2 \]w + W- = ^_^ 3 y 2 y/n/t. 

Furthermore, since w+(x) = A, by Theorem 2.8, we can estimate ^ to relative accuracy e, and 

therefore we can estimate |x| to relative accuracy 2e, in quantum query complexity -^j 2 y/n/\x\. 

These upper bounds do not have optimal scaling in e, as the actual quantum query complexi¬ 
ties of these problems are jfjy/n/t and \y/nf |x| [BBBV97, BHMT02, BBC + 01], however, using 
Theorem 2.9, the optimal query complexities can be recovered. 


2.4 Span Program Structure and Scaling 

In this section, we present some observations about the structure of span programs that will be 
useful in the design and analysis of our algorithms, and for general intuition. We begin by formally 
stating and proving Theorem 2.10 and Theorem 2.11, relating error to witness size. 

Theorem 2.10. Let P be a span program on [q] n and x € Pq- If |tw) is an optimal min-error 
positive witness for x, and u is an optimal exact negative witness for x, then 


M)t 


n 


H(x)- 


\W) 


n 


H(x)~ 


W 


2 ’ 


and so 


w-(x) 


1 

e+(x)' 


Proof. Let |u)) be an optimal min-error positive witness for x, and u> an optimal zero-error neg¬ 
ative witness for x. We have (uiA)\w) = tor = 1 and furthermore, since uiAII h(x) = 0) we have 

{ujA)TL h i x \±\w) = 1. Thus, write ^ + |tt) such that (tilll^^x|u>) = 0. Define 

|?Uerr) = n.ff(x)x|w)). We have A(\w) — IRerAl'Werr)) = A\w) = t, so by assumption that |tc) has 
minimal error, 


n 


H(x)- 


\W) 


< 


n, 


H(x)- L i. 1^) Ifker A^err)) ^ 1^) hi j. er jy | rc err ) 


n 


(ker A) x I w en 


} , and so we must have |u; e rr) € (kerd)- 1 . Thus, ker(u) e rr| Q kerd, 


SO |||w e rr)|| < n (kerA )X \w e 
so by the fundamental homomorphism theorem, there exists a linear function Q : col A 


such 
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that ooA = (w eTT |. Furthermore, we have lot = uoA\w) = (u)|n ff ( x \x \w) = |r/;) = e+(x), 


so oJ = — 7- 


has oo't = 1. By the optimality of oo, we must have ||wA||" < ||u/A|| , so 


+ \ u ) < 


l H(xA\W) 

e + (x) 


and so |u) = 0. Thus (wl) f = and w-(x) = \\ooA\\ 2 = ^ ~ = ^y- □ 

Theorem 2.11. Let P be a span program on [q] n and x G P\. If |u>) is an optimal exact positive 
witness for x, and Co is an optimal min-error negative witness for x, then 


i . n H (x)(,uA)^ 

\w ) = ' - 2 

CjAA h(x ) 


and so 


w + {x) = 


e_(x) 


Proof. Let Co be an optimal min-error negative witness for x, and define | w') = _ First 

11 ujA n jr (x) 11 

note that | w') G H(x). We will show that \w') is a positive witness for x by showing A\w') = r. 
Suppose r and A\w') are linearly independent, and let a G £(V,M) be such that a(A\w')) = 0 and 
a(r) = 1. Then for any e G [0,1], we have (eoo + (1 — e)a)r = 1, so by optimality of Go, 

||<^4n f/(x) || 2 < ||(ew + (1 - £)a)An H ( x) \\ 2 

= e 2 ||wAn ff(x) || 2 + (1 - e ) 2 ||a^n H(3 .)|| 2 since a(AIi H{x) {C:A)^) = 0 
(l- £ 2 )||cm4n^ (x) || 2 < (l-£) 2 ||aAn^)|| 2 . 

This implies ||<IL4IIff( x ) || < 0, a contradiction, since ||cDAn#( x \|| > 0. Thus, we must have A\w') = 

rr for some scalar r, so Co(A\w')) = rCo(r). We then have Gj(A\w')) = Co anc l so W e 

||w J 4n H (*)|| 

have r = 1, and thus A\w') = r. So |w/) is a positive witness for x. Let |tc) G H(x) be an optimal 
positive witness for x, so |||rc)|| 2 = w+(x). We have 

u)AU H r x \\w) Cot 1 ,, , 2 

{w |w) = —-- 2 = TT-m2 = :- 2 = \W w )ll • 

CjAh H ( X ) CjAUh^ 


Thus |||w/)|| 2 < |||u/)|| |||u>)|| by the Cauchy-Schwarz inequality, so since \iv) is optimal, we must 
have |||ru)|| = |||w/)||. Since the the smallest | w) such that ATl H r x \\w) = r is uniquely defined as 
( au h{x)) + t, we have \w) = Ire'). Thus w + (x ) = |||u;)|| 2 = ||Kc')|| 2 = — -jp = □ 


Positive Witnesses Fix a span program P = A) on [ q] n . In general, a positive witness 

is any \w) €. H such that A\w) = r. Assume the set of all such vectors is non-empty, and let |m) 
be any vector in H such that A\w) = r. Then the set of positive witnesses is exactly 

W := |w) + ker A = {|m} + | h) : \h) G ker A}. 

It is well known, and a simple exercise to prove, that the unique shortest vector in W is A + t, and 
it is the unique vector in IT H (ker A)- 1 . We can therefore talk about the unique smallest positive 
witness, whenever IT is non-empty. 
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Definition 2.12. Fix a span program P, and suppose W = {| h) E H : A\h) = t} is non-empty. We 
define the minimal positive witness of P to be |n>o) € W with smallest norm — that is, |wo) = A + t. 
We define N + (P):=\\\w 0 )\\ 2 . 

Since \wq) € (ker A)- 1 , we can write any positive witness |w) as |wo) + |wq-) f° r some l^o") € ker A. 
If we let T = A _1 (r), then we can write T = span{|wo)} © ker A. 


Negative Witnesses Just as we can talk about a minimal positive witness, we can also talk 
about a minimal negative witness of P: any ujq E £(V,M) such that cvq (t) = 1, that minimizes 
||woA||. We define IV_(P) = min wo:aJO ( T )= 1 ||u;oA|| 2 . Note that unlike |wo), coo might not be unique. 
There may be distinct ujo,cj' 0 £ jC(V, M) that map r to 1 and have minimal complexity, however, 
one can easily show that in that case, iw^A = uj' 0 A , and that the unique globally optimal negative 
witness in col A is 

For any minimal negative witness, ojo, ujoA is conveniently related to the minimal positive 
witness |wo) by ( ui^A )t = jvTfp)’ and -N + (P) = n \p) ■ (We ^ eave this as an exercise, since it is 
straightforward to prove, and not needed for our results). 


Span Program Scaling and Normalization By scaling r to get a new target t' = Bt, we 

can scale a span program by an arbitrary positive real number B, so that all positive witnesses are 
scaled by B, and all negative witnesses are scaled by Note that this leaves W + W- unchanged, 
so we can in some sense consider the span program invariant under this scaling. 

Definition 2.13. A span program P is normalized if N+(P) = N—(P ) = 1. 

Any span program can be converted to a normalized span program by replacing the target with 
t' = jf—. However, it will turn out to be desirable to normalize a span program, and also scale it, 
independently. We can accomplish this to some degree, as shown by the following theorem. 

Theorem 2.14 (Span program scaling). Let P = (H, V, r, A) be any span program on [q] n , and let 
N = || |wo) || 2 for |u>o) the minimal positive witness of P. For (3 E M>o, define P i8 = (. H 13 , , t 13 , A< 8 ) 

as follows, for |0) and |1) two vectors orthogonal to H and V: 

Vj E [n\,a E \q],H? a := H^ a , tff rue = H true 0 span{|i}}, H^ lse = H {a y se 0 span{|0)} 


= V (B span{| i}}, A? = fiA + \t){0\ + ^ + N |£)(1|, t p = \t) + \1) 

Then we have the following: 

• For all x € Pi, w+(x,pP) = -jpW+(x,P) + and w-(x, P /3 ) < fi 2 W-(x, P) + 2; 

• for all x E Po, W-(x,P' 3 ) = j3 2 W-(x, P) + 1 and w+(x,P /3 ) < -j^w+(x, P) + 2; 

• the minimal witness of P 13 is \wq) = pf +N \wo) + 10) + +N 11)? an d \ w q) 

Proof of Theorem 2.14 is postponed to Appendix A, as it consists of straightforward computation. 


= 1. 
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3 Span Program Algorithms 


In this section we describe several ways in which a span program can be turned into a quantum 
algorithm. As in the case of algorithms previously constructed from span programs, our algorithms 
will consist of many applications of a unitary on H, applied to some initial state. Unlike previous 
applications, we will use |iuo), the minimal positive witness of P, as the initial state, assuming P is 
normalized so that |||rco)|| = 1- This state is independent of the input, and so can be generated with 
0 queries. For negative span program algorithms , where we want to decide a function negatively 
approximated by P, we will use a unitary U{P, x ), defined as follows: 

U(P,x) := (2n kerA - I)(2U h{x) - I) = (2n (kerA) x - i)(m H{x)X - I). 

This is similar to the unitary used in previous span program algorithms. Note that (2II ker a — I) is 
input-independent, and so can be implemented in 0 queries. However, in order to analyze the time 
complexity of a span program algorithm, this reflection must be implemented (as we are able to do 
for our applications, following [BR12]). The reflection (2n#( x ) — I) depends on the input, but it is 
not difficult to see that it requires two queries to implement. Since our definition of span programs 
varies slightly from previous definitions, we provide a proof of this fact. 

Lemma 3.1. The reflection (2n#( x ) — I) can be implemented using 2 queries to x. 

Proof. For every i € [n] and a € [</], let = (I — 2n^x ni? .), the operator that reflects every 
vector in Hi that is orthogonal to Hi^ a . This operation is input independent, and so, can be 
implemented in 0 queries. For every i G [n], let {| Ai), ■■ • > IV’tm;)} be an orthonormal basis for H r . 
Recall that the spaces Hi are orthogonal, so we can map \ipij) >->• Then using one query, 

we can map |*)| ifij) i-» \i)\xi)\tpij). We then perform Ri )Xi on the last register, conditioned on the 
first two registers, and then uncompute the first two registers, using one additional query. □ 

For positive span program algorithms , where we want to decide a function positively approxi¬ 
mated by P, or estimate the positive witness size, we will use a slightly different unitary: 

U'(P,x) = (2U H{x) -I)(2U T -I), 

where T = ker A 0 span{|u;o)}, the span of positive witnesses. We have U' = U'(I — 2|n;o)(tco|). 

We begin by analyzing the overlap of the initial state, |u;o), with the phase spaces of the 
unitaries U and U' in Section 3.1. In particular, we show that the projections of \wo) onto the 
0-phase spaces of U and U' are exactly related to the witness size. Using the effective spectral 
gap lemma (Lemma 1.9), we show that the overlap of \wq) with small nonzero phase spaces is 
not too large. Using this analysis, in Section 3.2, we describe how to convert a span program 
into an algorithm for any decision problem that is approximated by the span program, proving 
Theorem 2.7, and in Section 3.3, we describe how to convert a span program into an algorithm 
that estimates the span program witness size, proving Theorem 2.8. 

Finally, in Section 3.4, we give a lower bound on the phase gap of U in terms of the spectra 
of A and A(x) = giving an alternative analysis to the effective spectral gap analysis of 

Section 3.1 that may be better in some cases, and proving Theorem 2.9. 

3.1 Analysis 

Negative Span Programs In this section we analyze the overlap of |u;o) with the eigenspaces of 
U(P, x). For any angle 0 € [0,7r), we define as the orthogonal projector onto the e* e -eigenspaces 
of U(P,x) for which \0\ < 0. 
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Lemma 3.2. Let P be a normalized span program on [q] n . For any x E [ q] n , 

ll n eko)ll 2 < ^-w+(x) + —3—. 

4 w-(x) 

In particular, for any x € P\, ||I1q| xt?o)|| 2 < %-w+(a;). 

Proof. Suppose x € Pi, and let | w x ) be an optimal exact positive witness for x, so II( ker a -,±\vj x ) = 
|wo). Then since II#©.)_l \w x ) = 0, we have by the effective spectral gap lemma (Lemma 1.9): 


|nSk>)lf = 




2 ©2 9 0 2 
< - lllw^f = —W + (x). 


4 4 

Suppose x E -Po and let oj x be an optimal zero-error negative witness for x and | w x ) an optimal 
min-error positive witness for x. First note that II( ker A \±.\w x ) = |wo), so n(j terA \±IlH-( a .)|w a: ) + 
n (ker J 4 )-Ln H(a , ) j.|w a; ) = |w 0 ). Since U H ^± (n H(x) | w x )) = 0, we have, by Lemma 1.9, 

2 

n0n (kerA) xn„ (3;) K> 


n© 


H-(ker A) 1 - (i)- 1 -1 ^x 


n© |w 0 ) - n ( 


(u x A )t 


0 2 llTT 

< — iin 

2 ©2 

_ T 

2 @ 2 IM . 

< T llk 


H(x) \ w x) 
2 


( ker^) ± w _( x ) 

In the last step, we used the fact that = ^H(x) 1 - l^x), by Theorem 2.10. Next note that 

n (kerA)- L kx^) t = faxA)* and U H ^±(u x Ay = {uj x A)\ so U(u> x A) f = {u x A)\ and therefore, 
n©(o; a; A) t = (uj x Ay. Thus: 


n©|w 0 ) - 


(uxA)^ 


W- {x) 


< 


ll n ©ko)ll + 


- 2 - 


02 


©2 


VJ X 


W-(x) W-(x) 


l|n©|w 0 )|| 2 + 


1 


- 2 - 


-(wo|n©(w T A) t < — w+(x) 

1 . + 0 2 


ll n eko)lr + 


W- (, X) W- (x) 

2 1 - 1 


(u x A\w 0 )) ] < — w + {x) 


- 2 - 


W-(x) W-(x) 


0 2 

-(w x r) t < — w+(x) 


lin©K )|| 2 < ^-u>+k) + IZJx)' 


where in the last line we used the fact that uj x t = 1. 

Lemma 3.3. Let P be a normalized span program on [q] n . For any x E [ q] n , 

l|n;K)ll 2 = 


□ 


W-(x) 

In particular, for any x E P\, ||IIq|wo)|| = 0. 

Proof. By Lemma 3.2, we have ||II(j|u>o) || 2 < w ^ ■ To see the other direction, let uj x be an optimal 
zero-error negative witness for x (if none exists, then w-(x) = oo and the statement is vacuously 
true). Define |u) = {u x A)^. By the proof of Lemma 3.2, U\u) = |u). We have (u\wo) = uj x A\wf) = 


uj x t = 1 and |||ri)|| 2 = ||u; x .A|| 2 = w-(x), so we have: ||IIg|u;o)|| 2 > 


IDH 


i«>ir 


K) 


— □ 

W— (x) 
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Positive Span Programs We now prove results analogous to Lemma 3.2 and 3.3 for the unitary 
U'(P, x ). For any angle 0 E [0, it), we define n 0 as the projector onto the 0-phase spaces of U'(P, x) 
for which \6\ < 0. 

Lemma 3.4. Let P be a normalized span program on [q] n . For any x E [ q] n , 


n'eK >|| 2 


©2 

< — W-(x) + 


1 

w + (x)' 


In particular, if x € Po; then 11LIq| ioo) 11 2 < ^-w-(x). 

Proof. If x E Pq, then let u x be an optimal exact negative witness for x, so uj x ATI h ^ x) = 0, and we 
thus have, by the effective spectral gap lemma (Lemma 1.9), 


UqII t (uj> x A^ < ||w xJ 4|| 2 = ^-w-(x). 


11 x 112 

We have uj x AU t = u x A(Ii kerA + |wo)(w 0 |) = u x A\wq)(w q \ = uj x t(w 0 \ = <ro 0 |, so ||H e |w 0 )|| < 

( \ 

—W-(x). 


Suppose x E Pi, and let \w x ) be an optimal zero-error positive witness for x, and ui x an 
optimal min-error negative witness for x. By Theorem 2.11, we have = LI h{x){& x A)^ . Since 

^H(x){do x AIi H ^A.) 1{ = 0, we have, by Lemma 1.9, 


U e U T (u x AU H{x) ^ 

- i^j) 


< 

< 


< 


02 

4 

02 

4 

02 

4 


lo x ah 
11 dl x A 11 ^ 
W-(x). 


H(x) A 


2 


In the last line we used the fact that ILrliUr) = P-h(x) \ w x) = \w x ), so U'\w x ) = | w x ), and thus 
Ilelrca;) = | w x ). 

Note that Cj x AHt = dJxAili^ a + |^o)(^’o|) = &xA\wo)(wo\ = Cj x t{wq\ = (wq\. Thus, we can 
continue from above as: 


IleK) - 


\w x ) 


W. |_(x) 


|n 0 K>|| + 


\W X 


w+(x ; 

TTeK >|| 2 + 


w + {x) 


w. |_(x) rc + (x) 


©2 

< W-(x) 

4 

—x 0 2 

(u;o|n e |i(; x ) < — w-(x) 

9 © 2 

{w 0 \w x ) < —w-(x) 

©2 


| n ©K>|| < -7-w-(x) + 


W. |_(x) ’ 


where in the last line we used the fact that (wq\w x ) = 1. 


□ 
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Lemma 3.5. Let P be a normalized span program on [q] n . For any x E [ q] n , 

||noK )|| 2 = 


w + (x) 


In particular, if x € Pq, then 11 IIq l^o) 11 = 0. 

Proof. By Lemma 3.4, j|rig|u;o )|| 2 < • Let | w x ) = |u;o) + |^o") be an optimal zero-error positive 

□ 


witness for x. Since \w x ) E H(x) D T, U'\w x ) = \w x ), so ||IIq|u;o)||" > 


(U>j;|l»0) i 

lll^lll 2 “ w+{x)' 


3.2 Algorithms for Approximate Span Programs 


Using the spectral analysis from Section 3.1, we can design an algorithm that decides a function that 
is approximated by a span program. We will give details for the negative case, using Lemma 3.2 
and 3.3. A nearly identical argument proves the analogous statement for the positive case, using 
Lemma 3.4 and 3.5 instead. 

Throughout this section, fix a decision problem / on [q] n , and let P be a normalized span 
program that negatively A-approximates /. By Lemma 3.3 and 3.2, it is possible to distinguish 
between the cases f{x) = 0 , in which w ^ > pp-, and f(x) = 1, in which w ^ < pp- using phase 
estimation to sufficient precision, and amplitude estimation on a 0 in the phase register. We give 
details in the following theorem. 


Lemma 3.6. Let P be a normalized X-negative approximate span program for f. Then the quantum 
query complexity of f is at most O ^ \j W+ log ■ 

Proof. Let U(P,x ) = \ I ) (Vb1 5 and let \wq) = Y1J'=] a jVl , j)- Then applying phase esti¬ 
mation (Theorem 1.5) to precision 0 = \j an d error e = produces a state |wg) = 

YljLi a j\' l l , j)\ U} j) such that if 0j = 0, then | uf) = 10), and if \9j\ > 0 then ||0) | 2 < e. Let Ao be 
the projector onto states with 0 in the phase register. We have: ||Ao|u;g)|| = l a il 2 |(0| a; i)| 2 - 

Suppose x E / _ 1 (0), so ||IIq|u;o )|| 2 = J2j-0 j= o l a j | 2 — w 2(x) ) by Lemma 3.3, and thus we have: 

||A 0 |u'o)|| 2 — 5Z l a il 2 K°l 0 )| 2 = ll n oK)|| 2 >—"7 t — TU - =: Po- 

j'-0j 0 ■' 


On the other hand, suppose x E / 1 (1). Since P negatively A-approximates / and x E / 1 (1), 
w-(x,P ) > \W+{x,P). By Lemma 3.2, we have 


Ilelruo) || 2 <-t-- + —— w + (x, P ) <- 1 - 

ol 0/11 - w-(x,p) 4 J ~W _ 3W_W 4 


Wi 


1 1 + 2A 
3 W- 


and thus 

||Aoko)|| 2 < J2 l a il 2 + 12 l“il 2 |( w ji°)| 2 

Tl®j|<© h|0jl>© 


n eK)l| 2 + £ ^2 I'T'I" — 

r-\8j\>0 


1 + 2A i 1 - A 
3W_ + 6W- 


By Corollary 1.8, we can distinguish between these cases using O 
which costs g log 7 . In this case, we have 


Po ~P 1 



2 \ _ 1 , 1 \ 

3 A 6 ' 6 A 


W.- 


11 - A 
2 W _ ‘ 


calls to phase estimation, 
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The total number of calls to U is: 


1, 1 w_ [wlwl , W_ wJw + w _ 

° g i “ y/WI(l- A)V !- A S r^A “ (I-A ) 3 / 2 log l^A' D 

In addition to wanting to extend this to non-normalized span programs, we note that this 
expression is not symmetric in the positive and negative error. Using Theorem 2.14, we can nor¬ 
malize any span program, while also scaling the positive and negative witnesses. This gives us the 
following. 


Corollary 3.7. Let P be any span program that negatively A- approximates f. Then the quantum 
query complexity of f is at most O f (1 _* )3/2 \JW-(f,P)W+{f,P) log T=a J - 


Proof. We will use the scaled span program described in Theorem 2.14. Let /3 
pP is a normalized span program with 


1 

y/W-{f.P) 


. Then 


1 


W-(f,P^)= max w-(x,P IJ ) = /3 max w-(x, P) + 1 = —— W- + 1 = 2. 
' xef-po) xef-p 0 ) W- 


and W + (f,P l3 )= max w+(x, P 13 ) < —^ max uj+(x, P) + 2 = W-(f, P)W+(f, P) + 2. 


xef-pi) 


f3 2 x&f-pi) 


If we define A^ := . xef 1(0) — ! ’2 = f 2 ^ = t^t, then clearly Ph negatively A 

mm^z-lfi) w~(x,PP) p 2 fW-{f,P )+1 ±+l ’ J a j 

approximates /, so we can apply Lemma 3.6. We have = yzr so we can decide / 

1 A 1- T+a 

in query complexity (neglecting constants): 


1 + A 
1 - A 


2 ( W-(f, P)W+(f, P) + 2 ) log 2 


,1±A 

'1-A 


(l-A)l 


W-(f, P)W+(f, P) log 


1 - A 


□ 


By computations analogous to Lemma 3.6 and Corollary 3.7 (using /3 = yTF+), we can show 
that if P positively A-approximates /, then / has quantum query complexity O ^ 1 _^ 3 / 2 \JW + W- log 
This and Corollary 3.7 imply Theorem 2.7. 


3.3 Estimating the Witness Size 

Using the algorithms for deciding approximate span programs (Theorem 2.7) as a black box, we 
can construct a quantum algorithm that estimates the positive or negative witness size of an input 
using standard algorithmic techniques. We give the full proof for the case of positive witness size, 
as negative witness size is virtually identical. This proves Theorem 2.8. 

Theorem 3.8 (Estimating the Witness Size). Fix f : X C [q] n —>• M>o- Let P be a span program 
on [q] n such that for all x € X, f(x) = w + (x, P). Then the quantum query complexity of estimating 

f to accuracy e is O ^ y/ w +(?)W-(P) ^ . 

Proof. We will estimate e(x) = ^ . The basic idea is to use the algorithm from Theorem 2.7 to 

narrow down the interval in which the value of e(x) may lie. Assuming that the span program is 
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normalized (which is without loss of generality, since normalizing by scaling r does not impact rel¬ 
ative accuracy) we can begin with the interval [0,1]. We stop when we reach an interval [e m ; n , e max ] 
such that the midpoint e = ema - x + emin satisfies (1 — e)e max < e < (1 + e)e m i n . 

Let Decide(P, w, A) be the quantum algorithm from Theorem 2.7 that decides the (partial) 
function g : Pi —>• {0,1} defined by g(x) = 1 if w+(x) < w and g(x) = 0 if w + (x) > j. We 
will amplify the success probability so that with high probability, Decide returns g(x) correctly 
every time it is called by the algorithm, and we will assume that this is the case. The full witness 
estimation algorithm consists of repeated calls to Decide as follows: 

WitnessEstimate(P, e): 


l e « - i - n e (1) - 2 e « - 1 
r. e max — i, e min — u, eq — 3 , e 0 — 3 

2. For i = 1,2,... repeat: 

(a) Run Decide(P, w, A) with w = 1/e^ and A = /e(\ 


(b) If Decide outputs 1, indicating w+(x) < w, set = emax and eO' n ' = e 0 . 

(c) Else, set e£t 1} = e^ n and e&£P = ef. 

(d) If emax^ < (1 + return e = tmax ■ 

(e) Else, set ej x+1) = + ^ej^ and ej +1) = + §e£t 1} . 


,(<+!) _ Ji) 


We can see by induction that for every i, eO n < 
12 


^ —-— < emax- This is certainly true for i = 1 , 


- Cmax - 

since w + {x) > |||i^o)II^ = L Suppose it’s true at step i. At step i we run Decide(P, Wi, A*) with 
Wi = 1/e^ and jf = l/e[, l) . If > e^*, then Decide returns 1, so we have € [e^,e^ ax ] = 

If ^jy < 4°, then Decide returns 0 , so we have ^y € [e£J n , e[°] = [ej^, ek« } ]. 
1 € [eo\e^], which is a subset of both [e^ \ emax] and [e ^-„ , e^], so in any case, 


Otherwise, 

l 

W-\-{x) 


w+(x) 
l e miri > e max J • 


(i) (i) 

To see that the algorithm terminates, let Aj = emax — e L„ denote the length of the remaining 
interval at round i. We either have A i+i = etL — = emax — 3 emax - § e mi n = §Aj, or 

Aj + i = - ej^ in = |emax + ge^ in - e^ in = §Aj, so A; = (2/3) l_1 . We terminate at the smallest 

T such that (2/3 ) T_1 = A t = — e| r ^ t \ < (1 + e — l)e ^ < w u . Thus we terminate before 

r = riog 3 / 2 ^ + ii- 


e = ^(emax+e ^ ) be the returned estimate. Recall that we only terminate when e iwL < (l+e)e ^ . 


- ife (T) 
We have 


Next, we show that, assuming Decide does not err, the estimate is correct to within e. Let 

minate when e £T 2 

< (1 + e)w + (x), 

') w +( x ) > (l - |) w+{x). 


and 


min/ 

i 

e 

> 


(T) . (T) 

emax + e min 


< 


emax (l + 


< 


1 


> 


1 


e e m in(l T 1 T e) 
Thus, 11/e — w + (x )| < ew+(x). 


w+(x) 


(1 + e/ 2 ) 


1 

( 2+£ ) 

t!/+(cc) 


(i- 

e /2 


1 + e /2 ^ 
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( V™W- 1. 1 ) T _ 1 /„(*) 

\ v (1_ A )3/S 

e o } / e i ?) be the values used at the * th iteration. Since < e tL < ^ + A*, we have 


By Theorem 2.7, Decide(P, re, A) runs in cost O ^ ^_"^ 3/2 log yry j • Let wg = 1/e /' 1 and Aj = 


,(i) 


1 - 


< 


w+(x) 


+ A i 


A i pW _ p(») _ 2 (i) , ij») _ l _(*) _ 2 (*) re + (x)Aj 

e i e o 3 e m ax-t- 3 e min 3 e m ax 3 e min ’ 


+ 3 — 0( 1/e), 


since Aj = (2/3 )*" 1 > (2/3) 1 ^ 


. Observe 


Vwi 


„(0 


< 


(1-A;) 3 / 2 ( e W_ e ( i ))3/2 - Vu)+(x) 


+ Aj 


A 375 ’ 


so, ignoring the log = 0 (log |) factor, the cost of the i th iteration can be computed as: 
\l WnW- 

Q = 


(1 - A*) 3 / 2 


< \ W- 


1 


w+(x) 


+ A* 


3 _oV W - / 3\5 
— O 


a 3/2 


ry + (x) 


3 (i-i) 


+ 3 \W. 


3 \ |(t-L 


We can thus compute the total cost (neglecting logarithmic factors): 
T 

< 


i= 1 


< o 



W- f w+ ( x ) A 3/2 



( 1)^-1 


(l) 1/2 -l 


using the fact that (2/3) T = 0 • 

Finally, we have been assuming that Decide returns the correct bit on every call. We now justify 
this assumption. At round i, we will amplify the success probability of Decide to 1 — |(2/3)* _1 , 
incurring a factor of log(9(3/2)* _1 ) = O(logLi-Al) i n the complexity. Then the total error is at 
most: 


E^/sr 1 


1 1 - (2/3) 


T—l 


9 


1 - 


-in- 


W+(x) 


< 


1 


i=l ~ 3 

Thus, with probability at least 2/3, Decide never errs, and the algorithm is correct. 


□ 


3.4 Span Program Phase Gap 

The scaling in the error from Theorem 3.8, 1/e 3 / 2 , is not ideal. For instance, we showed in 
Section 2.3 how to construct a quantum algorithm for approximate counting based on a simple 
span program for the OR function with complexity that scales like l/e 3//2 in the error, whereas 
the best quantum algorithm for this task has complexity scaling as 1/e in the error. However, the 
following theorem, which is a corollary to Lemma 3.3 and Lemma 3.5, gives an alternative anal¬ 
ysis of the complexity of the algorithm in Theorem 3.8 that may be better in some cases, and in 
particular, has the more natural error dependence 1 /e. 


Theorem 3.9. Fix f : X C [g] n —>• R >0 . Let P be a normalized span program on [q] n such 
that X C P Q) and for all x € X, w-(x,P) = f(x); and define A(/) = min xeA -A (U(P,x)). 

Then there is a quantum algorithm that estimates f to relative accuracy e using O 

queries. Similarly, let P be a normalized span program such that X C P 1? and for all x € X, 
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w+(x,P) = f(x); and define A'(/) = min xe x A (U'(P,x)). Then there is a quantum algorithm that 
estimates f with relative accuracy e using O ^7 ^ A'(/ 7 ~^ Q uer "i es - 

Proof. To estimate W-(x), we can use phase estimation of U(P,x ) applied to \wq), with precision 
A = A(/) and accuracy e = (V/) , h° wever > this results in logW- factors, and W- may be 

significantly larger than u’-(x). Instead, we will start with e = and decrease it by 1/2 until 

, ~ g 

^ W—(x,P) * 

Let |u!q) be the result of applying phase estimation to precision A = A(/) and accuracy e, and 
let Ao be the projector onto states with 0 in the phase register. We will then estimate 11Ao|W q) 11 2 
to relative accuracy e/4 using amplitude estimation. Since A < A(U(P, x)), we have ||IIq|u;o )|| 2 < 
IIAol^o)II 2 < ll n Al u ’o)l | 2 + e = l|ng|w;o )|| 2 + e- By Lemma 3.3, we have ||ng|w 0 )|| 2 = so we 

will obtain an estimate p of —V-r such that 

1 W— [X) 

ip - ( 1 + i) (^m + e )' 

If p > 2(1 + |)e, then we know that w > e, so we perform one more estimate with accuracy 
e' = |e < and return the resulting estimate. Otherwise, we let e' = e /2 and repeat. 

To see that we will eventually terminate, suppose e < Then we have 

p > (1 - e/A)—^— > (3/4)4e > (3/4)(4/5)(1 + e/4)4e > 2(1 + e/4)e, 

W- (, x) 

so the algorithm terminates. Upon termination, we have 

p < (1 + e/A) f - — + e^ < (1 + c/4) f -—— + -- T~^] — (l + 9 )-tti 

\w-(x) J \w-[x) 8w-(x)J V 2 / w-(x) 

so |1 /p — rc_(x)| < ew-{x). By Theorem 1.5 and 1.7, the total number of calls to U is: 


log Aw— (a?) 


1 V w -(x ) lo£2< + y/w-{x) log W_{x) _ 1 y/w-(x) 


' log 6w— (x) 


i =0 


A 


which is at most log 


accuracy e using O I 


( V w +( x ) 

A'e 


Ae 

2 w~(x) _ 

£ 

calls to U’ 


A 


Y i + lo § 


W-(x) 


i =0 


= O ( )• Similarly, we can estimate w+(x) to relative 


□ 


Theorem 3.9 is only useful if a lower bound on the phase gap of U(P,x) or U'(P,x ) can be 
computed. This may not always be feasible, but the following two theorems shows it is sufficient 
to compute the spectral norm of A, and the spectral gap, or specifically, smallest nonzero singular 
value, of the matrix A(x) = AAljjfx)- This may still not be an easy task, but in Section 4, we show 
that we can get a better algorithm for estimating the effective resistance by this analysis, which, 
in the case of effective resistance, is very simple. 

Theorem 3.10. Let P be any span program on [q\ n . For any x € [q] n , A (U(P,x)) > 
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Proof. Let U = U(P,x). Consider —U = (2n^ ker ^x — I)(2H H ^ — I). By Corollary 1.11, if D 
is the discriminant of —U, then A (U) > 2a m [ n (D ), so we will lower bound (T rn - m {D). Since the 
orthogonal projector onto (ker^4)' i = rowT is A + A, we have D = A + AH h i x \ = A + A(x). 

We have cr m [ n (D) = miri|,^ GrowD , so let |rt) € rowH be a unit vector that minimizes 
||D|'u)||. Since |tt) € rowH C row^4(:r), we have ||T(x)|u}|| > cr m i n (^4(a;)). Since A(x)\u) € colA(x) C 
coM = rowT + , we have 


0 -min (D) = ||A + A(x)|tt)|| > a m \n(A + ) ||A(x)|u)|| > (T m \n (A + )cr min (A(x)) 


< 7min( J 4(3')) 

cr max( J 4) 


since a min (^+) = Thus A (U) > 2 tr ^g ) . 


□ 


Theorem 3.11. Let P be any span program. For any x € P\, A (U'(P,x)) > 2 °'™ m b'|fo)) . 

Proof. We have 

-U'(P,x)i = (2(1 - biker .4©span{|too)}) L)(2II^^ a; ) /) (2(1 IlkerA b[|u,o)) I)(2YL]-[^ X ^ /), 

since |u>o) € (kerA) - * - , so —U'(P,x)^ has discriminant: 


T (H-(ker A)- 1 Hjruo) )^i?(x) H-(ker A) 1 - L^H(x) b^|?0o)^(ker A)- 1 ^H(x) 1 ■luo ) ^ ' 

Since x € Pi, let \w x ) = t!(x) + |t). Then D\w x ) = A + A(x)\w x ) = A + \t) = |ti>o), so |n>o) € coLD. 
Let (|0o) = |wo), |0i), • • •, |0r-i)} be an orthogonal basis for coLD. Then we can write D = 
X)i=o I^K^I for \ v i) = 0 (not necessarily orthogonal). Then D' = Ya =o n^x \4>i){vf\ = 

Ya =i so co[D ’ = span{| 0 i),..., |^ r _i)} = {](/)) € colL> : (4>\w 0 ) = 0 }. Thus: 


Cmin (D') = 


> 


min 

|it)EcolZ) / 



min 
| it) E col.D 



min 

| u) EcolD: (wo |^)=0 


H n 


l™o) 


D 


Ml 


•^min(Tl). 


min 

|tt)ecolD:(ioo|u)=0 


II W\D 
INII 


By the proof of Theorem 3.10, we have <r m i n (.D) > and by Corollary 1.11, we have 

A(t/'(P,x)t) = A (U’(P,x)) > 2 a min (D') > 2a min (D) > □ 

Combining the last three theorems, we get the following, which has Theorem 2.9 as a special case: 

Theorem 3.12. Fix f : X C [ q] n — > M >0 , and define n(f) = max^gx ■ Let P be any 

span program on [g] n such that X C P 0 (resp. X C Pi), and for all x € X, f(x) = W-(x,P) 
(resp. f(x) = w+(x,P)). Let N = |||wo)II 2 • Then there is a quantum algorithm that estimates f 

to relative accuracy £ using O (^pp \JN f Jxfj (resp. O (^p ) queries. 

Proof. Let P' be the span program that is the same as P, but with target t' = Then it’s clear 
that p= is the minimal positive witness of P' , and furthermore, it has norm 1, so P' is normalized. 
We can similarly see that for any x € Pi, if | w x ) is an optimal positive witness for x in P, then 
-L^\w x ) is an optimal positive witness for x in P', so w + (x, P') = w+ p p> . Similarly, for any x € Pq, 

if u x is an optimal negative witness for x in P , then \fiNaj x is an optimal negative witness for x in 
P', so W-(x,P') = Nw-(x,P). By Theorem 3.10 and 3.11, for all x G X, a([/(p' x )) — K (/) (resp. 
A(U'(P' x)) — K (/))- The result then follows from Theorem 3.9. □ 


21 























4 Applications 


In this section, we will demonstrate how to apply the ideas from Section 3 to get new quantum 

algorithms. Specifically, we will give upper bounds of O(nyfR s .t /) and 0(nyfR Sj t/ A 2 A) 011 

the time complexity of estimating the effective resistance, R s ,t, between two vertices, s and t, in 

a graph. Unlike previous upper bounds, we study this problem in the adjacency model, however, 

there are similarities between the ideas of this upper bound and a previous quantum upper bound 

in the edge-list model due to Wang [Wanl3], which we discuss further at the end of this section. 

—y 

A unit flow from s to t in G is a real-valued function 9 on the directed edges E(G) = {{u,v) : 
{u,v} € E(G)} such that: 

—y 

1. for all (u,v) € E, 9{u,v) = — 9{y,u ); 

2 . for all u € [n] \ {s,f}, Ylve r(u) 0 ( u i v ) = 0) where T(u) = {u € [n] : {u,v} € P}; and 

3- E ue r( s ) 0 ( s > u ) = E«er(t) 0 (“> *) = L 

Let T be the set of unit flows from s to t in G. The effective resistance from s to t in G is defined: 

R s ,t( G ) = min 'V' 9(u,v) 2 . 

{u,v}£E(G) 

In the adjacency model, we are given, as input, a string x E {0, l} nxn , representing a graph 
G x = ([n], {{i,j} '■ x i,j = 1}) ( we assume that x^i = 0 for all i, and Xij = Xj^ for all i,j ). The 
problem of st-connectivity is the following. Given as input x E {0, l} nxn and s,f E [n], decide if 
there exists a path from s to t in G x \ that is, whether or not s and t are in the same component of 
G x . A span-program-based algorithm for this problem was given in [BR12], with time complexity 
0(ny/p), under the promise that, if s and t are connected in G x , they are connected by a path of 
length < p. They use the following span program, defined on {0, l} nxn : 

#(ivT0 = {°}> H (u,v),i = span{\u,v)}, V = M n , A= ^ O) - \v))(u, v\, \t) = |s) - |t). 

u,v£[n] 

We have H = span{|u,u) :«,»£ [n]}, and H(x) = span{|u,u) : {u,v} E E(G X )}. Throughout this 
section, P will denote the above span program. We will use this span program to define algorithms 
for estimating the effective resistance. Ref. [BR12] are even able to show how to efficiently imple¬ 
ment a unitary similar to U(P, x), giving a time efficient algorithm. In Appendix B, we adapt their 
proof to our setting, showing how to efficiently implement [/'(P' 3 ,®) for any < ft < n°^ 

and efficiently construct the initial state \wq), making our algorithms time efficient as well. 

The effective resistance between s and t is related to st-connectivity by the fact that if s and t 
are not connected, then R St t is undefined (there is no flow from s to t) and if s and t are connected 
then R s j is related to the number and length of paths from s to t. In particular, if s and t are 
connected by a path of length p, then R Sjt (G) < p (take the unit flow that simply travels along this 
path). In general, if s and t are connected in G, then j < R s j(G ) < n — 1. The span program for 
st-connectivity is amenable to the task of estimating the effective resistance due to the following. 

Lemma 4.1 ([BR12]). For any graph G x on [n], x € Pi if and only if s and t are connected, and 
in that case, w + {x,P) = \R s ,t(G x ). 

A near immediate consequence of this, combined with Theorem 2.8, is the following. 
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Theorem 4.2. There exists a quantum algorithm for estimating R S ^{G X ) to accuracy e with time 

and 


complexity O 


space complexity O(logn). 


Proof. We merely observe that if G is a connected graph, an approximate negative witness is u : 
[n] —> R that minimizes llcuAn#^) || = Yl{u v y eE (uj(u) — oj{v)) 2 and satisfies w(s) —u(t) = 1. That 
is, c jj is the voltage induced by a unit potential difference between s and t (see [DS84] for details). 
This is not unique, but if we fix ui(s) = 1 and ui(t) = 0, then the oj that minimizes is 

unique, and this is without loss of generality. In that case, for all u € [n], 0 < u(u) < 1, so 

w-(x) = ||u;A || 2 = J2 u ,vG[n]( 0J ( u ) ~ u ( v )) < 2 ™ 2 and thus W_ < 2 n 2 . 


By Theorem 2.8, we can estimate R s j to precision e using O 

to U'(pP,x) for some (3, which, by Theorem B.l, costs O(logra) time and space. □ 

By analyzing the spectra of A and A{x), and applying Theorem 2.9, we can get an often better 
algorithm (Theorem 4.3). The spectral gap of a graph G, denoted A 2 (G), is the second largest 
eigenvalue (including multiplicity) of the Laplacian of G, which is defined Lq = X4e[nl d u \u)(u\ — 
Ylueln] SuGr(n) w h ere d u is the degree of u, and T(u) is the set of neighbours of u. The 

smallest eigenvalue of Lq is 0 for any graph G. A graph G is connected if and only if A 2 (G) > 0. 
A connected graph G has < A 2 (G) < n. 

The following theorem is an improvement over Theorem 4.2 when A 2 (G) > e. In particular, it 
is an improvement for all e when we know that A 2 (G) > 1. 

Theorem 4.3. Let Q be a family of graphs such that for all x E Q, A 2 (G x ) > p. Let f : Q x [n\ x 
[n] —>• R>o he defined by f(x,s,t ) = R. s jfG x ). There exists a quantum algorithm for estimating f 
to relative accuracy e that has time complexity O (Any 7 R s ,t(G x ) / p) and space complexity O(logn). 

Proof. We will apply Theorem 2.9. We first compute |||u;o)|| 2 , in order to normalize P. 

Lemma 4.4. N = |||u>o )|| 2 = A. 

Proof. Since H(x) = H when G x is the complete graph, by Lemma 4.1, we need only compute R S) t 
in the complete graph. It’s simple to verify that the optimal unit sf-flow in the complete graph has 
A units of flow on every path of the form (s, u, t ) for u E [n] \ {s, t}, and A units of flow on the edge 
(s,t). Thus, R s ,t(K n ) = £ue[n]\{ s ,t} 2(l/n ) 2 + (2/n ) 2 = 2/n. Thus ||K )|| 2 = \R s ,t{K n ) = A. □ 

Next, we compute the following: 

Lemma 4.5. For any x € G, = \ l \ Jn \ < \f%> so K (f) < ■»/?• 

y ’ v m in(M x )) y ^2 (Gx) — \f a’ ~ u 

Proof. Let L x denote the Laplacian of G x . We have: 


■\/W-w + (x) \ _ 
^372 


= O 


ny/Rs,t(G x ) 

^372 


calls 


a{x)a{ X ) t = (h - hxh - H) = 2 d n\ u )( u \ mh = 2Lx - 

uE[n] v(ElT(u) uE.[n] u£[n\v€.T(u) 

Thus, if L denotes the Laplacian of the complete graph, we also have AA T = 2 L. Letting J 
denote the all ones matrix, we have L = (n — 1)1 — (J — I) = nl — J, and since J = n|u)(u| where 
|«) = £”=1 |i), if |ui),..., \u n -i), | u) is any orthonormal basis of M n , then L = nYFiZi Wi)(ui\ + 

n\u)(u\ — n\u)(u\ = YFi=i n \ui)(ui\, so the spectrum of L is 0, with multiplicity 1, and n with 
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Figure 1: The graphs in Go contain only the solid edges. The graphs in Gi contain the solid edges 
and one of the dashed edges. We can embed an instance of OR in the dashed edges. If one of the 
dashed edges is included, the number of st-paths increases to 2, decreasing the effective resistance. 

multiplicity n— 1. Thus, the only nonzero singular value of A is \/2n = cr max (T). Furthermore, since 
A 2 (G x ) is the smallest nonzero eigenvalue of L Xl and A(x)A(x) T = 2 L x , cr min (^4(x)) = ^2X 2 (G x ). 
The result follows. □ 

Finally, by Lemma 4.1, we have w + (x,P) = \R s j{G x ), so, applying Theorem 3.12, we get an 

algorithm that makes O ^4^ \J W+ 4' 1 ^ = ® (7 \/ n /Ay/RsjnJ calls to U'(P, x). By Theorem B.l, 

this algorithm has time complexity O (^ny 7 R s ,t/h) and space complexity O(logn). □ 

Both of our upper bounds have linear dependence on n, and the following theorem shows that 
this is optimal. 

Theorem 4.6 (Lower Bound). There exists a family of graphs Q such that estimating effective 
resistance on Q costs at least Gl(n) queries. 

Proof. Let Go be the set of graphs consisting of two stars K l n / 2 -\, centered at s and t, with an 
edge connecting s and t (see Figure 1). Let Gi be the set of graphs consisting of graphs from Go 
with a single edge added between two degree one vertices from different stars. Let G = Go U Gi- 
We first note that we can distinguish between Go and Gi by estimating effective resistance on Q 
to accuracy If G £ Goi then there is a single st-path, consisting of one edge, so the effective 
resistance is 1. If G € Gi, then there are two st-paths, one of length 1 and one of length 3. We 
put a flow of j on the length-3 path and | on the length-1 path to get effective resistance at most 
(3/4) 2 + 3(1/4) 2 = |. 

We now describe how to embed an instance y € {0, l}( n / 2-1 )“ of OR^^-i) 2 i n a graph. We 
let s = 1 be connected to every vertex in {2 ,..., n/2}, and t = n be connected to every vertex in 
{n/2 + 1,... , n — 1}. Let the values of {Gij : i € {2,... , n/2}, j £ {n/2,... , n — 1}} be determined 
by y. Let all other values G/j be 0. Then clearly R s j(G) > 1 if and only if y = 0... 0 (in that case 
G £ Go) and otherwise, R St t(G) <3/4, since there is at least one extra path from s to t (in that case 
G £ G 1 ). The result follows from the lower bound of H(\/ {n/2 — l) 2 ) = fl(n) on OR( n / 2 -i) 2 . □ 

Discussion The algorithms from Theorem 4.2 and 4.3 are the first quantum algorithms for es¬ 
timating the effective resistance in the adjacency model, however, the problem has been studied 
previously in the edge-list model [Wanl3], where Wang obtains a quantum algorithm with com¬ 
plexity O ) where 4>(G) < 1 is the conductance (or edge-expansion) of G. In the edge-list 

model, the input x £ [n]^^ models a d-regular graph (or d-bounded degree graph) G x by x U} i = v 
for some i £ [d] whenever {u, v} £ E(G X ). Wang requires edge-list queries to simulate walking on 
the graph, which requires constructing a superposition over all neighbours of a given vertex. This 
type of edge-list query can be simulated by \fnfd adjacency queries to a d-regular graph, using 
quantum search, so Wang’s algorithm can be converted to an algorithm in the adjacency query 
model with cost O (^ 4(G)' 2 e \/l') ■ We can compare our results to this by noticing that R s j < ^ 2 (G) 
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[CRR + 96], implying that our algorithm always runs in time at most O If G is a con¬ 

nected d-regular graph, then A 2 (G) = d5(G), where 5(G) is the spectral gap of a random walk 
on G. By Cheeger inequalities, we have < 5 [LPW09], so the complexity of the algorithm 
from Theorem 4.3 is at most O ( 73 ^) = O (7 ^§ 7)1 which is an improvement over the bound of 
O \f^j = O (^\/n) given by naively adapting Wang’s algorithm to the adjacency model 

whenever d > ^fn. In general our upper bound may be much better than A^p-, since the Cheeger 
inequality is not tight, and R s j can be much smaller than -p. 

It is worth further discussing Wang’s algorithms for estimating effective resistance, due to 
their relationship with the ideas presented here. In order to get a time-efficient algorithm for 
sf-connectivity, Belovs and Reichardt show how to efficiently reflect about the kernel of A (see 
also Appendix B), A being related to the Laplacian of a complete graph, L, by AA 1 = 2 L. This 
implementation consists, in part, of a quantum walk on the complete graph. Wang’s algorithm 
directly implements a reflection about the kernel of A(x ) by instead using a quantum walk on the 
graph G, which can be done efficiently in the edge-list model. For general span programs, when 
a reflection about the kernel of A(x) can be implemented efficiently in such a direct way, this can 
lead to an efficient quantum algorithm for estimating the witness size. 

We also remark on another quantum algorithm for estimating effective resistance, also from 
[Wanl3] . This algorithm has the worse complexity O ( S* ) > and is obtained by using the HHL 


algorithm [HHL09] to estimate ||A(x) + |t)|| 2 , which is the positive witness size of x, or in this case, 
the effective resistance. We remark that, for any span program, w+(x) = |||u ; ;E )|| 2 = ||A(x) + |r)||", 
so HHL may be another means of estimating the positive witness size. There are several caveats: 
A(x) must be efficiently row-computable, and the complexity additionally depends on , 


the condition number of A(x) (We remark that this is upper bounded by tiff)) ’ upon which 
the complexity of some of our algorithms depends as well). However, if this approach yields an 
efficient algorithm, it is efficient in time complexity, not only query complexity. We leave further 
exploration of this idea for future research. 


5 Conclusion and Open Problems 

Summary We have presented several new techniques for turning span programs into quantum 
algorithms, which we hope will have future applications. Specifically, given a span program P, in 
addition to algorithms for deciding any function / such that / _ 1 (0) C P 0 and / _ 1 (1) C P \, we also 
show how to get several different algorithms for deciding a number of related threshold problems, 
as well as estimating the witness size. In addition to algorithms based on the standard effective 
spectral gap lemma, we also show how to get algorithms by analyzing the real phase gap. 

We hope that the importance of this work lies not only in its potential for applications, but 
in the improved understanding of the structure and power of span programs. A number of very 
important quantum algorithms rely on a similar structure, using phase estimation of a unitary 
that depends on the input to distinguish between different types of inputs. Span-program-based 
algorithms represent a very general class of such algorithms, making them not only important to 
the study of the quantum query model, but to quantum algorithms in general. 

Further Applications The main avenue for future work is in applications of our techniques to 
obtain new quantum algorithms. We stress that any span program for a decision problem can now 
be turned into an algorithm for estimating the positive or negative witness size, if these correspond 
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to some meaningful function, or deciding threshold functions related to the witness size. A natural 
source of potential future applications is in the rich area of property testing problems (for a survey, 
see [MdW13]). 

Span Programs and HHL One final open problem, briefly discussed at the end of the previous 
section, is the relationship between estimating the witness size and the HHL algorithm [HHL09]. 
The HHL algorithm can be used to estimate ||M + |«)|| 2 , given the state |u) and access to a row- 
computable linear operator M. When M = A(x), this quantity is exactly w+(x), so if A(x) is 
row-computable — that is, there is an efficient procedure for computing the nonzero entry of 
the j th row of A(x), then HHL gives us yet another means of estimating the witness size, whose 
time complexity is known, rather than only its query complexity. It may be interesting to explore 
this connection further. 
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A Span Program Scaling 

In this section we prove Theorem 2.14. Let P = ( H,V,t,A ) be any span program on [q] n , and 
let N = ||K>|| 2 for |rco) the optimal positive witness of P. We define P 3 = (H 3 , A 13 ,t 3 , V 3 ) as 
follows. Let |0) and |1) be two vectors orthogonal to H and V. We define: 

Mj G [n], a G [q], H 3 a = H jta , Pf rue = H trae © span{| i)}, P f ^ lse = iL false 0 span{ 10)} 

V 13 = V 0 span{|i)}, A 13 = (3A + r|0) + + N |i)(i|, t 13 = t + \1) 

We then have and H 3 = H 0 span{|0), 11)} and H 3 (x) = H(x) 0 span{|l}}. In order to prove 
Theorem 2.14, we will show that: 

• For all x G Pi, w+(x, P 3 ) = -jpw+{x, P ) + N +pi and w-{x, P 13 ) < j3 2 w-(x, P ) + 2; 

• for all x G Pq, W-(x, P 13 ) = /3 2 W-(x, P) + 1 and w+(x, P 3 ) < -hu> + (x, P) + 2; 
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the smallest witness in P 13 is \wff) = W^/vl^o) + -Jr^ |0) + ,f 11), and \w$) 


Lemma A 


WA-N 1 u/ 1 W+N^' 1 

„ft\ — ft L..-\ j_j _ft 


= 1. 


1. The smallest witness in P 13 is |icq) = p± +n \wq) + |0) + ^^ 2+N I1)• It is easily 


verified that 


W'n 


= 1. 


Proof. Let |uJq) = | h) + 6|0) + c|i) be the smallest witness in P 13 , for some | h) E H. Since 
A ^ 3 1 IC’o) = /3^4| /?■) + bT + C ^ ^" + N _ \ j \ _ _J_ Il\ ,.,o vnviol- noire e — -ft_ 


-p -11) = r + 11), we must have c = ^^ 2+N an d A\h) = ^/r r ! 


so 


| h) = -l- 7 r^|'u;) for some positive witness |u;} of P. We have: 


|K)|| = 


2 ( 1 - 6) 2 . ,2 , / 5 2 


P 2 


k)lr + T + 


/? 2 + iV' 


This is minimized by taking |u>) = |rco), the smallest witness of P, and setting b = N 


Ji+Ni giving: 


\Wn) = 


l^o) + 02 i A7-1^) ^ / oo . „J ^)~ ^ 


P 2 + N 


P 2 + N ^/JP + N 


Lemma A. 2. For all x € Pi, w+(x, P 13 ) = ^w + (x , P ) + and W- {x,pP)<p 2 W-{x,P) + 2 


Proof. The proof is similar to that of Lemma A.l, however, we have H^(x) = H(x) © span{|l)}, 


so a positive witness for x has the form \ w' x ) = \h) + 


VfP+N 


|1) with f3\h) some witness for x in P. 


Clearly |||u/ c )|| is minimized by setting | h) = p\w x ) for \w x ) the minimal positive witness for x in 


P, so we have w + (x,P 13 ) = pW + (x,P) + p+N ' as re( l u i re d. 

Let Cj be an optimal min-error witness for x in P, and define 


uj = 

We have uj'[t + |i)) = 

u'A 13 n 


-uj + 


P 4 


(P 2 + N)w + (x,P) 

P 4 + (P 2 + N)w + (x,P)~ ' P 4 + (P 2 + N)w+(x, P) 

(P 2 + N)w+(x,P) , P 4 


(i| 


l HP{x) 


P 4 + (P 2 + N)w + (x, P) 
(, P 2 + N)w+(x, P) 


uj{t) + 


p 4 + (p 2 + N)w+(x, P) 
2 


= 1, and: 


P 4 + ( P 2 + N)w + (x, P ) 

(P 2 + N) 2 w + (x,P) 2 P 2 1 

(, P 4 + (P 2 + N)w + (x, P)) 2 w + (x, P) 
(P 2 + N) 2 w + (x, P)P 2 + P 6 (P 2 + N ) 


+ 


p 4 

VP 2 + n 

P 4 + ( P 2 + N)w + (x, P) 

P 

P 8 

P 2 + n 

A + (p 2 + N)w+{x,P)) 2 

P 2 

P 2 (P 2 + n) 

1 


(i| 


(p 4 + (P 2 + N)w+(x, P)) 2 P 4 + (P 2 + N)w+(x, P) w + (x,P ft) 

so uJ is a min-error witness for x in P l3 . Thus, letting e = p4+(fiJ^Nyw+(Pp) ’ we ^ iave 


W-(x,P' 3 ) < 


UJ 


A/ 3 


eujfiA + ea}(r)(6| + 




P 


©'(i)<i| 


< /? 2 ||© A || 2 + 1 + 


p 2 + N 


P 8 


p 2 (P 4 + (P 2 +N)w+(x,P)) 2 
< p 2 w-{x, P) + 1 + 13 , < P 2 w+(x,p) + 2, 


(p 4 + P 2 w + (x, P)) 2 
where in the last line, we use the fact that w + (x, P) > N. 


□ 
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Lemma A.3. For all x € Pq, w-(x, P= (3 2 W-(x, P ) + 1, and w+(x, P&) < w+{x, P) + 2. 

Proof. Let oj' x be an optimal negative witness for x in P 13 . Since cj x Il H pr x -\ = 0, u' x |1) = 0, so 
ix' x (t /3 ) = (J x (t) +w^(|i)) = oj' x (t) = 1. Furthermore, u/ x minimizes 


u. 


,A& 


— 11 /^ 4 ^ + o 4 ( t )| 0 )|| —/? 2 ||<* 4 A || + 1 . 


This is minimized by taking uj' x \y to be the minimal negative witness of x in P, so ||o;^A|| 2 = 
w-(x,P ), and thus w-(x,P@) = (3 2 w-(x,P ) + 1. 

Next, let \w) be an optimal min-error positive witness for x in P. Define: 


\w ) := 


/3w-(x, P) 


\w) + 


| 6 > + 


P 


1 + /3 2 w-(x,P)' 1 + f3 2 w-(x,P) 1 y'g 2 + n 


:|i>- 


We have: 


A\w') = 


(3 2 w-(x, P ) 


-r + 


1 + /3 2 w-(x,P) 1 + P 2 w-(x,P) 

and since H^ix) 3 - = H(x) 1 - © span{|0}}: 


r + |1) = r + |1) = t 13 , 




n 


\w 


+ 


H( X y 
(3 2 w-(x, P ) 2 
(1 + /3 2 w-(x , P )) 2 
/ 3 2 w-(x , P) 2 


n,n\|u)') 


n 


H(x) ± \ w ) 

1 


1 


+ 


(1 + /3 2 w-(x, P )) 2 

1 


(1 + /3 2 w-(x, P )) 2 w-(x, P ) (1 + (3 2 w-(x , P)) 2 

1 1 


1 + /3 2 w-(x, P) w-(x,PP)’ 


so \w') has minimal error. Thus: 

~ , d/ 3 n ^ II | ~/\ II2 f3 2 W-(x,P) 2 

w+{x,P p ) < |||«;)|| = 


(1 + {3 2 w-(x, P)) 2 


\w) 


+ 


+ 


P 2 


(1 + (3 2 w-(x, P)) 2 P 2 + N 


< 


P 2 W-(x, P) 2 w + (x, P) 
(1 + P 2 W-(x , P)) 2 


+ 2 < 


P 2 W-(x, P) 2 w + (x , P) 
P 4 W-(x , P) 2 


+ 2 = 


w+(x,P) 

P 2 


+ 2 . □ 


B Time Complexity Analysis 

In [BR12], the authors analyze the time complexity of the reflections needed to implement their 
span program to give a time upper bound on sf-connectivity. Since our algorithms look superficially 
different from theirs, we reproduce their analysis here to show an upper bound on the quantum 
time complexity of estimating effective resistance. 

Theorem B.l. Let P be the span program for st-connectivity given in Section f. Then for any P 
such that l/n °W < P < n 0 ^, U'(P^,x) can be implemented in quantum time complexity O(logn) 
and space O(logn), and |u;q) can be constructed in quantum time complexity O(logn). 
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Proof. In order to implement U'(P^,x), we must implement the reflections R x (/3) = 2H H p^ — / 
and R' P (/3 ) = 2II ker A ,a ffisp (Sjj — I. We remark that R x (f3) is easily implemented in a single query 
and constant overhead. This proof deals with the implementation of R' P (/3 ), which can be easily 
implemented given an implementation of Rp = 2IIker A — I- 

In order to implement Rp, we describe a unitary W = (2TLz — I)(2Hy — I) that can be efficiently 
implemented, and such that W can be used to implement Rp. In order to show that W implements 
Rp, we need to show that some isometry My : H —>• Y maps ker A to the — 1-eigenspace of W, 
and (ker A) -1 " to the 1-eigenspace of W. This allows us to implement Rp by first implementing the 
isometry My, applying W, and then uncomputing My. 

Define the spaces Z and Y as follows: 

( ! ! 1 

Z = span < \z u ) := — , > 10, u, u, v) -|- , > 11, u, v, u) : u € \n] > ; and 

V VT^T)^J J 

Y = span | \y UjV ) := (|0 ,u,u,v) - \l,v,u,v)) /V2 : u,v € [n\,u v| . 

Define isometries 


M z = Y \z u )(u 

«6[n] 


and My = Y \y u , v )(u,v 

(u,v)(E:[n] 2 :u^v 


Lemma B.2. Let S = { My\if ) : \if) € kervl} and S' = {My\if) : \if) € (ker A)- 1 } be the images 
of ker A and (ker A)- 1 - respectively under the isometry My. Then 5 = 7fl Z 1 - , which is exactly the 
intersection ofY and the —1-eigenspace of W, and S' = Y n Z, which is exactly the intersection 
ofY and the 1-eigenspace ofW. 


Proof. We have: 

M ] z My = —jL=^2^2\u)({0,u,u,v\ + (1,u,v,u\) Y (|0,a,a,6) - \l,b, a,b)){a,b\ 


uG[n] v^u 


a,b(E:[n]:a^b 


2 y/n — 




u,v\ — 


uG[n] v^u 


2\Jn — 


x E Z >>( 


u, v\ = 


wG[n] v^u 


2y/ffYTi 


A. 


Thus, for all \if) € keryl, My\if) € Y Pi ker = 7n Z^, soSChfl Z L . On the other hand, if 
\i/j) € (ker A)- 1 , then My\if) Ghfl (ker M^) 1 - =Y(lZ. By Theorem 1.10, the —1-eigenspace of W 
is exactly (Y n Z L ) 0 (T- 1 0 Z) and the 1-eigenspace of W is exactly (Y 0 Z) 0 (y- 1 0 Z - 1 ). □ 

Lemma B.3. My, Rz = 211 z — I and Ry = 2Tly — I can be implemented in time 0(logn). 

Proof. To implement Rz and Ry, we need only show how to implement the unitary versions of Mz 
and My. We begin with Mz- For any u € [n], we can map |u) i— >■ \0,u,u, 0} by initializing three 
new registers and copying u into one of them. Then we map: 


10, u, u, 0} i->- 10, u, u) 


y/n — 1 


E 

V^U 


i . H®I ® 3 


V2(n- 


= | |o, u, u) Y b) + |i, «, «> E b)) 
' \ V^U v^u J 


\x u ), 


where the last transformation is achieved by swapping the last two registers conditioned on the 
first. This can be implemented in O(logn) elementary gates. 
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For My, we start by mapping any edge \u,v) to |l,0,u, v), followed by: 


|1,0, u, v ) H H -j= (10, 0, u, v) - |1,0, u, v)) H- ~^= (|0, u, u, v) - |1, v, u, v)) = \ y u , v ), 


where in the last step we copy either u or v into the second register depending on the value of the 
first register. This can be implemented in 0(1) elementary gates. 

Then in order to implement Rz, we simply apply M' z , reflect about span{|0, u, u, 0) : u € [n]}, 
and then apply Mz again. To implement i?y, we apply My, reflect about span{|l, 0, u, v) : u, v € 
[n], u / v}, and then apply My. □ 

We now show how to efficiently implement the span program P@ when 1 /n°^ < P < n°^\ 
First, consider \wq), the minimal positive witness for P. Since \wq) corresponds to an optimal 
si-flow in the complete graph, it is easy to compute that 


wo) 


1, 1 
n lM) + 2^ 


Y (|s,«) + \u,t)) 


uG[n]\{s,t} 


-|i,s) 

n 


1 

2 n 


Y (i^ u ) + i u > s ))> 

«e[n] 


and |||wo)|| 2 = y (see also Lemma 4.4). We can construct this state by mapping |s,0) + |0,i) i->- 
Ylujts I s ) u ) + \ U P) an d then performing a swap controlled on an additional register in the 

state ^=(|0) + |1)). The initial state of the scaled span program P 13 is (see Theorem 2.14): 



P 


P 2 + 


tK) + 


P 


2 + ± 
1 n 


| 6 ) + 


P 


P‘ 


+ n 


4 ). 


which we can also construct efficiently, as follows: 


|6)-». 


Py/n 


P 2 


rl 2 ) + 


l 


np 2 + 1 


| 6 ) + 


P 


P‘- 




: |i) 


P 


P 2 + i 


I Wo) + 


1 

n 


p 2 + 


tI°) + 


P 


P‘- 


+ b 


: |i). 


The first step is accomplished by a pair of rotations using O(log^) elementary gates, and the 

second is accomplished by mapping |2) to ||jy(yy|j- = v^lwo), which can be accomplished in O(logn) 
elementary gates. 

Next, we have = /3A + (|s) — |i))(0| + ^^ 2 |1)(1|, so 


ker A 13 0 span{|rcg)} = ker A 0 span{|0) 


^K)}0span{|u^)}. 


We know how to reflect about ker A, and since we can efficiently construct |u>q), we can reflect 
about it, so we need only consider how to reflect about span{|0) — ^|u;o)}. Since we can compute 
\wq) efficiently, we can compute: 


|6W 


P 


Jp 2 


:| 6 ) + 


: |i) ^ 


P 


| 6 ) + 


y/^TT v^n ^/W +1 


-1 w Q ). 


The first step is a rotation, which can be performed in O(log^) elementary gates, and the second 
step is some mapping that maps |1) to |u>o), which we know can be done in O(logn) elementary 
gates. Thus, the total cost to reflect about ker A 13 is O(logn). □ 
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